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Abstract 

. «f tv,^ nioital Imaeine and Communication in Medicine 
Structured Reporting (SR), a ^^PP^^^V^^'^" ^^^[^^ The format of these SR reports is 
(DICOM) standard, specifies a fndard form for m^p^^^ed and viewed in the DICOM 

DICOM and therefore they can be readily ^t^^^^^^ ^^^f^"^^; ™ ^ the non-DICOM world. The 
world. However, it is not convenient to ^'^'^^^^f. ^^r^*'^/^^^^ Wide Web Consortium (W3C), 
Extensible Markup Language (XML) a ^P«^^^°^„^^^^^ an ideal solution for representing 

specifies a universal format for strucmred ^^m^t^^ ^^orid, provided that an engine is available to 
id viewing DICOM structured reports m the ^^^'^^^^^rrmat Such an engine has been built on top 
translate these reports ^^^2!*^^^^^^^ (XSLT) components. It is 

Si'^srSri" 

show that the engine works well. Iv be built into a server 

The current prototype engine is a standalone appUca^^^^^^ ,,^,l,te 
component in the DICOM world to connect to the non-DICOM wo 
DICOM messages other than structured reports. 
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1. Introduction 



ia to particular tamges and/or teT^M bU foLa. and ca« b. stored, 

users ofSRcanb.radio.ogist,..^c,aj.p^Uja^^^^^^ 

distributed ai»l using different systeins. ^'''^^^^,,^^"l^^±,^aity. Ms report focuses 

r;^nriS^X"-"~ 



v/Aj. " iT 

technoloevr?!, or by other means. 

technology L/J, fnr DTCOM is limited to certain specialties, primarily those dealing 

allowing medical reports to be accessed and viewed m the non-DICOM world. 

. , » .oii«i nirOMlXML which takes a structured report m DICOM 

TewaS^L technology into healthcare information technology domam. 

2. Application Scenario 

Figure 1 shows an application scenario DIC0M2XML J^e left^^^ 

en^onment with modalities and DICOM archives for -^f ^^^^^"^^^ The right half 

system (PACS), where structured reports are ^^^^^^f ' ^J^f^^^^^^^^ desire to have access to the 
/epicts an environment wi^ l^^^/ig^^^^s^^^^^^ reports from DICOM format 

information in DICOM archives. DIC0M2XML ^^^^ j^J^^^ ^^^^y of devices ranging from 
to XML format. The resulting XML docum^ts can ^^.f ""^J^^^^^^^ clinical physicians to 
workstations to portable hand-held ^-^^ J^X^^^^^^^^^^ (e.^. PACS) from 

?ooS TOs allows enterprise-wide access of the structured mformation. 
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Figure 1. Application «e,«rio of DICOMIXML W«sl.t.r 

messa^ in the future, for exa,»ple. *P°»S>t,dSS healthcare, an enterprise-wide 
inclosing development of telemedicme, WebMD „„! med more in the DICOM 

distribuUon of medical mformatton But Operation amottg physicians, clmrcans, or 

world where image acqmsttton plaj^ ^„"»°^l!°j^cal r^orts where DICOM is not m common 

Xld Ss DIC0M2XML translator provides such a functton. 



3. Design Decisions 

There areanun^erofdesignscenariosinhuildingthistransla^ 

. Oneistoparse^einp^DlCOMh^^^ 

document type defimtton (DTD) usmg an P ^^^.basedonte 

. Another is to parse the mput l^KOM b^ Be a^ aSv - XSL sty.eshe« using an 

.the^sclUeOlCOM^*-^^^^^ 

.•_*u=.^,^♦vYl^/^T. DOM tree. 



Ute^o^ places in theWy XML DOM tree. 
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Figure 2. Software components and data flow diagram, scenario I 



In the second scenario, the DICOM parser parses the DICOM binary file and the raw XML builder 
dumps DICOM information to a new XML document with simple structure. Then an XSLT engine, with 
a SR XSL stylesheet, transforms this raw XML document into a SR XML document. 
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Figure 3. Software components and data flow diagram, scenario n 

A DICOM parser is needed in both cases. The first case requires only two steps, but the SR XML builder 
will need to incorporate coding logic dependent on the DTD used, thus reducing the reusability of this 
component. In the second case, the raw XML builder is simple to implement by adding tags to the parsed 
DICOM attributes. An XSLT engine is applied with a SR XSL stylesheet to produce the final SR XML 
document. There are numerous XSLT engines available, either commercial or free. In this case, most 
application-specific logic resides in stylesheets and therefore maintenance is simplified. For this reason 
we selected the second approach. 



4. Component Selection 

This engine is developed under a Microsoft Visual C++ 6.0 environment, which limits the component 
selection to C or C++ versions of available tools. 
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The MergeC0M3 toolkit (version 3.0) [2] is a widely used commercial product for DICOM implemented 
in ANSI C from Merge Corp. Philips Medical Systems has purchased a corporate license. For this reason 
we choose the MergeCOM3 toolkit for this implementation. 

There are a number of XML parsers currently available, many of which are open source and can be freely 
used. In order to simplify engine implementation and comply with XML standards, a C-H- XML parser, 
Xerces version 1.1, from Apache Software Foundation [9], is selected. 

5. Component Diagram 



DIC0M2XML 




Figure 4. Component diagram of DICOM2XML 

Figure 4 shows the component diagram of the SR DIC0M2XML translator. DIC0M2XML, the main 
component, processes the command line and invokes the different components sequentially. The DICOM 
parser loads a DICOM binary file and parses it using the Merge DICOM toolkit. The XML builder 
dumps the result from the DICOM parser into a raw XML document following a set of rules. And then 
the XSLT engine takes the raw XML document and a SR XSL stylesheet to generate a SR XML 
docxmient. 
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The module design of the DICOM parser is as follows: 



DIC0M2XML 



diconiFileNaine : stnng 
rawXMLFileName : string 
stylesheetFileName : string 
targetXMLFiieName : string 



DICDM2XML0 
InitializeMergeComLibO 
ReleaseMergeComlibO 
LoadDICOMFileO 
GetMessageldO 
DICDMPars erAndXMLBuilderO 

XSLTProcessorO 



Figure 5. Class diagram of DICOM2XML module 

Figore 5 Aow. die compWe member operatton. and variab.es of tt>e DICOM2XML module. The 
detailed descriptions are as follows: 

dicomFileName^^^^ is the path name of a SR DICOM binary file. 

tareetXMLFileName 

A string which is the path name of an SR XML document. 

rawXMLFCeName j 

A string which is the path name of a raw XML document. 

stylesheetFdeNai^^^ is the path name of an SR XSL stylesheet. 

''''^''™ctor of the class DIC0M2XML. It takes three parameters, SR DICOM binary input, SR 
XSL stylesheet, and targets SR XML document. 

■""^WuShtSeCOM .ooUd. library. I. remms «e if i. is sucoessM. Omerwise, i. re«™ 

false. 

"'""'I'rSS'.J^^.y used by MergeCom toolld.. I. remms ttue if it is successful. Oherwise, it 

returns false. 

LoadDICOMHteO^ DCIOM binary file arrd converts it irtto a DICOM message object. 

"Sv^S^^S^frSa message object and writes tbe parsed DICOM auributes to an XML 

document. 
GetMessageldO 
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Returns the DICOM message object identification. 
'^''^Sl tt>e raw XML — . inu, an SR XML document by applying an SR XSL 



Stylesheet. 



6. Implementation 



Tl.e software for the DIC0M2XML engine includes one header file and two implementation files: 
DIC0M2XML . h - header file defining all the member functions of the translator. 
DIC0M2XML . cpp - implementation of DIC0M2XML . h. 

DIC0M2XMLMain.cpp- main program. 
CompilingDICOM2XML.handDICOM2XML.cppproducesastatic library XML2DXC0M.l^ 

Sroiam calls the fimctions in this library to do the translation. 
The rules by which a raw XML file is generated are as follows: 
1 The root element ofthe raw XML document is named 'report'. 

attribute: CodingScheme, Codeld, and ValueType. 

3. A DICOM Item or Sequece Item contains other DICOM attributes. Rules 2 and 3 are applied 

recursively. . T\^nr\\i{ 

oc f^ilnw*:- ra^ Chanse all the upper case letters m the UlCUM 

4. The element name is ^^^^ between two words in the name with an 
attribute name to lower case, (b) ^^l^'^^^^^Xir^XReDla^ hyphen ( - ) and slash ( / ) with 
underscore; (c) Remove apostrophes and brackets, (d) Replace nypnen ^ ; 

underscore (_ ). ^ . 

5. The value „t"codtagSche«e is fixed as "DCMTAO". m value of Codeld and value of Value are 
directly taken fi:om the DICOM parser. 

6. The value of ValueType is determined fi-om the mapping table below: 



DICOM Data Type 

AE, AS, CS, DA, DS, 
DT, IS, LO, LT, PN, 
SH, ST, TM, UT, UI, 
TJN. OB. OW, PL 



SR XML ValueType 

string 




These rules are compliant with the SR XML DTD described in [8]. 
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7. SR XSL Stylesheet 

A set of XSL s^^.^ is «a.ed to-^^TS^t^iitr "ST^te same 
definition (lOD) modules: basic text SR, enhanced SR, and comprenens 
fXTtion Jties (ffis) and modules as shown m the foUowmg table. 




Study 

Series 
Equipment 



Document 



Module 

Patient 



Specimen Identification 
General Study 



Patient Study 
SR Document Series 
General Equipment 



SR Document General 



SR Document Content 



SOP Common 



stylesheets in the three lOD modules. 

The high level structure of the SR XSL stylesheets is as follows: 

<xsl : include href = "Patient_IE . xsl " /> 
<xsl : include href = " Study_IE . xsl " /> 
<xsl-include href =''Serie s_IE. xsl"/ > 
^xsl include href=-Equipment IE.xsl"/> 
:xsl : include ^ref ="Document_IE^siy > 
<--- template for top-level element > 
<xsl : template match= "report " > 

<SRDocument> „^»„„r-h id"><xsl :value-of select="®id"/> 

<xsl: attribute '^^«= '^«P^^^-^tte"><xsl- value -of select="®date"/> 
<xsl -.attribute name=" report date ><^^i.vai 

<xsl :call-template ^^^^''^f ^^"^^^ //^ 
<xsl : call -template «ame= Study IE /> 
<xsl: call -template name="Series_IE /> 

<xsl:call-template ^^^'H'^^^jf^/j' 
<xsl: call -template name="Dccument_IE /> 

< / SRDocument > 
</xsl:template> 

The P«. S^yJB, Sertes_m, and ^^Xil.T^^^^"^ " '^'^ 
Stylesheets. The following shows the core part of Pat ient_IE tempia 

Patient_IE.xsl: 
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<xsl: template nanie="Patient_IE"> 
<Patients> 

<Patient> 



:xfl: call-template name="patxentxd template /> 
<xsl : call-template name="patients_Dircn aa r 
:x:i:call-ten|late name=«pat ents s«^^^ 
<xsl:apply-templates ^P^eSatient Id3"/> 

<xli • S?S-templates selects "ethnic_group"/> 
<xli;S?S-templates select="patient_comments /> 

</Patient> 
</Patients> 
</xsl:template> 

Stylesheets: SR„Docuiuent_General_Module . xsi, 3K_uo 
and SOP_Common . xsl: 

-t Ar.n^.^r{f^ href="SR Document General_Module.xsl"/> 
Tsi; rciSi Sff="L:Document:Con^^^^^^ 
<xsl- include href ="SOP_Common_ Module. xsl /> 
<xsl : template name= " Document_IE " > 
<Document> 

<SRDocumentGeneral>^^^^^ name=-SR_Document_General_Module"/> 

< / SRDocumentGener al > 

<SRDocumentContent>^^^^^ name= - SR_Document_Content_Module " / > 

</SRDocument Content > 

^'°^™sl:call-templatename="SOP_Common_Module"/> 

</SOPCommon> 
</Docvunent> 
</xsl : template> 

H ff^«.t lODs is in the SR Document Content Module. Each SR lOD has a 
The difference among the different SR iUUs is in inc ojv 
separate SR Document_Content_Module.xsl. 



8. Test Results 

. ^ fi. ^ ^R XML2DIC0M engine [6] are used, so both the original SR XML 
input to the SR XML2DIC0M engine. 

9. Conclusions 

currently mostDICOMreports/archivesarem^^^^^^ 

distribution. An engine convertmg medical ^«P°^^^^°^^^^° p^^^ support. Thus medical 
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to be processed in a structured way allowing anticipated uses ranging from outcomes analysis to 
epidemiological research. 

The design scenario we chose uses COTS XSLT components, it is more flexible and can easily adapt to 
updated versions of XSLT engines. In the current implementation, the Xalan XSLT engine from Apache 
is used, but it can easily be replaced by others. 

The rules to dump DICOM binary files into raw XML documents are the same rules as in mapping the 
SR UML model to XML representation [8], which makes the creation of the SR XSL stylesheets more 
convenient. The resulting modularized XSL stylesheets are easy to maintain, and can be reused other 
DICOM services. 

The test results showed that this DIC0M2XML translator works very well. The next step is to build a 
software component based on this prototype and integrate it into a DICOM data server linked to DICOM 
archives. 



11 



Company Restricted 



10. References 

[1] Digital Imaging and Communications in Medicine (DICOM), Supplement 23: Structured Reporting 
Storage SOP Classes, Final Text, DICOM Standards Committee, Rosslyn, VA, April, 2000. 

[2] DICOM toolkit documentation, Merge Technology Inc, http://www.merge.com . 

[3] Document Object Model (DOM), http://www.w3c.org/D0M . 

[4] Extensible Markup Language (XML) 1.0 W3C Recommendation lO-February-1998, 

http://www.w3c.org . 

[5] Extensible Stylesheet Language (XSL), http://www.w3c.org/Stvle/XSL . 

[6] J. Hu, Prototyping of SR XML to DICOM engine, TN-2000-050, Philips Research USA, Nov 2000. 
[7] Object Management Group, http://www.omg.org . 

[8] A. Tirado-Ramos & J. Hu, UML Modeling and XML DTD representation of DICOM StrucUired 
Reporting, TR-2000-019, Philips Research USA, Aug 2000. 

[9] Xerces XML Parser, Apache Software Foundation, http://xml.apache.org . 



12 



Company Restricted 



11. Glossary 



CORB A Common Obj ect Request Broker Architecture 

DICOM Digital Imaging and Communications in Medicine 

DTD Document Type Definition 

IE Information Entity 

lOD Information Obj ect Definition 

PACS Picture Archiving and Communication System 

SR Structured Reporting 

XML Extensible Markup Language 

XSL Extensible Stylesheet Language 

XSLT XSL Transformations 



13 



